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INTELLIGENT HARVESTING AND screen size, the presentation of information on any one 

NAVIGATION SYSTEM AND METHOD device is not optimized, for example, for its screen size due 

to the variety of formatting alternatives. 

This application claims benefit of provisional application Most of the prior approaches to wireless content delivery 

Ser. No. 60/160,801 filed Oct 21, 1999. 5 have involved linking a certain browser with a certain 

protocol and a certain mark-up language. There is not a 

BACKGROUND OF THE INVENTION ^uSrd iTat is pe^velhrou^out the multitude of 

This invention relates generally to a system and method wireless handheld devices and information appliances. As a 

for delivering content to information appliances and in consequence of this lack of a standard, content providers are 

particular to a system and method for permitting web pages 10 forced to re-author and re-format their web pages in order to 

in different formats to be communicated to various different generate content for each of these devices, 

information appliances having different display footprints. Compatible languages, such as Extensible Markup Lan- 

Today, people have an unquenchable thirst for informa- guage (XML), a software language designed especially for 

tion that demands instant access at any time or place to the Web documents, have become much more mature and 

information. There has been explosive growth in the hand- 15 permit re-formatting of HTML or XML web pages on-the- 

held computing market and in the cell phone market with Ay *° formats that individual devices can utilize. However, 

over 300 million users worldwide. In addition, there has none of these conventional systems and solutions provide a 

been a continued increase in the number of people with single unified system that permits web pages having differ- 

access to the Internet. These three different markets will cnt formats and mark-up languages to be delivered to 

soon converge as cell phones and handheld computers will different information appliances that may use different 

have web browsers ("mini browsers") integrated therein. To protocols, different browsers, or have different input/output 

promote the convergence, cell phone manufacturers and formats (e.g., different screen sizes). Thus, none of the 

wireless data network providers have attempted to standard- conventional systems provide an intelligent navigation sys- 

ize Internet content distribution with the wireless application tern wherein content may be delivered in a customized 

protocol (WAP). Internet content providers have been slow 25 manner to the different information appliances, 

to adopt these standards and now several major device Another problem with the conventional systems is that 

manufacturers have begun to create their own proprietary content providers have not been able to control the "look and 

standards. feel" of their site using these other solutions so that the site 

Two factors hinder the extension of the Web and is content ^ may look very different on different devices. It is desirable, 

from the personal computer (PC) environment with fairly however, to provide a system that will allow these sites to 

standard display formats to the non-PC based information customize the presentation of their site web pages to the 

appliances and devices. First, re-purposing and converting wide variety of information appliances. Thus, it is desirable 

existing PC-centric HTML web sites to the new breed of to provide a content delivery system and method that solves 

information appliances with drastically varying screen sizes 35 the above limitations and problems with the conventional 

is very problematic. For example, it is not appropriate to put systems and it is to this end that the present invention is 

the content of a PC-centric web site onto a small smart directed, 
phone screen linearly. An intelligent navigation scheme that 
automatically converts content intended for the PC into 

content applicable to one or more different information ^ The content delivery system and method in accordance 

appliances is needed. This intelligent navigation scheme with the invention solves the above problems and limitations 

would optimally vary its output based on the screen size of with conventional systems and solutions by providing a 

the particular information appliance. Second, most current system and method that delivers Web-based content, 

wireless content delivery solutions demand adherence to commerce, enabling transactions, and services to a variety of 

proprietary browsers, proprietary mark-up languages and/or 4S information appliances and devices without requiring the 

proprietary protocols. re-authoring of the content information for display on each 

As FIG. 1 illustrates, there are currently multiple different of these different devices, 

mark-up languages 2, multiple different protocols 3 and ^ ^ In accordance with the invention, the system and method ^ 

different browsers 4. For example, Phone.com has intro- permits content to be input into the system in a variety of 

duced both the HDML and the WML. protocols for cellular 50 different formatting languages. In addition, the system per- 

telephones in the United States, whereas Japan has adopted mits the formatted content to be output in any mark-up 

the I-mode protocol. Palm Pilot devices support a variant of language and protocol, such as WML, HTML, HDML, 

the HTML protocol that uses web-clipping, while Windows XML, etc. Advantageously, each display page on the device 

CE devices support only a limited HTML protocol using may be customized. To organize the content for display on 

special software such as Pocket Explorer. To establish an 55 the devices, the received content information may be 

effective wireless presence, a company with content must mapped into a hierarchy of groups so that the content 

support the multitude of different information appliances 5, information can be optimally formatted for display on the 

the different protocols 3, the different markup languages 2 devices according to thejn^u^ojutput format, suchjsjthe 

and the different browsers 4. disglayscreenja^ of the devices: _ 

The number of devices, protocols and mark-up languages 60 In more detail, the metEod for content delivery may 

create a large matrix of different combinations of devices, include intelligently harvesting content from a web page to 

protocols and languages wherein each combination requires provide that content to a plurality of different information 

a different web server. The rewriting of a site for each , appliances having different screen sizes. The intelligent 
mark-up language and for interfacing with each different 



SUMMARY OF THE INVENTION 



harvesting may convert the content into a proprietary rela- 
protocol and screen size is expensive, complicated and time ek tional markup language (RML) and generate a tree and the n 



consuming. In addition, because each different device may 
have a different input/output format, such as a different 



a document object model from the RML content. The tree 
may then be analyzed and searched using a set of processing 
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rules in order to generate content screens customized to eacb 
information appliance. A typical card builder may build the 
card corresponding to the customized content and a typical 
deck builder may build a deck of cards corresponding to the 
one or more display screens that make up the content for the 
particular information appliance. Hie deck of cards may 
then be converted into a presentation format and protocol for 
the particular information appliance and sent to that infor- 
mation appliance. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a diagram illustrating examples of different 
conventional wireless content delivery systems; 

FIG. 2 is a block diagram illustrating a content delivery 
system in accordance with the invention; 

FIG. 3 is a functional diagram illustrating a content 
delivery system in accordance with the invention; 

FIG. 4 is a diagram illustrating the translation server of 
the content delivery system shown in FIG. 2; 

FIG. 5 is a block diagram illustrating an embodiment of 
the content delivery system of the invention. 

FIG. 6 is a diagrammatic view illustrating the content 
connection handler of the translation server shown in FIG. 
4; 

FIG. 7 is a diagram illustrating an intelligent harvesting 
method in accordance with the invention; 

FIG. 8 is a diagrammatic view illustrating the XML 
engine of the translation server shown in FIG. 4; 

FIG. 9 illustrates an example of the mapping of an HTML 
web page into an RML object in accordance with the 
invention; 

FIG. 10 is a diagrammatic view of the layout engine of the 
translation server shown in FIG. 4; 

FIG. 11 illustrates an example of a portion of an HTML- 
based web page from the E-TRADE website illustrating the 
grouping of different elements of the web page in accor- 
dance with the invention; 

FIG. 12 is a diagrammatic view of a data structure tree for 
ordering the different groups and atomics of the E-TRADE 
web page shown in FIG. 11; 

FIG. 13 is a diagram illustrating a recursive tree analysis 
in accordance with a preferred embodiment of the invention; 

FIG. 14 is a diagrammatic view of the tree structure 
shown in FIG. 12 illustrating a collapsing methodology for 
processing the tree in order to create cards in accordance 
with the invention; 

FIG. 15 is a diagrammatic view illustrating an example of 
card formats that are created in processing the tree structure 
shown in FIG. 12; 

FIG. 16 is an example of a portion of an HTML-based 
web page from the CitySearch.com website illustrating the 
grouping of different elements of the web page in accor- 
dance with the invention; 

FIG. 17 is a diagrammatic view of a data structure tree for 
ordering the different groupings and atomics of the City- 
Search.com web page shown in FIG. 16; 

FIG. ISA is an example of a screen shot of a Palm Pilot 
device showing a presentation page of the CitySearch.com 
website shown in FIG. 16; and 

FIG. 18B is an example of a series of screen shots of a 
cellular telephone device showing a presentation page of the 
CitySearch.com website shown in FIG. 16. 

DETAILED DESCRIPTION OF A PREFERRED 
EMBODIMENT 
The invention is particularly applicable to system and 
method for delivering web content to a variety of different 
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information appliances having different display formats and 
different screen sizes and it is in this context that the 
invention will be described. It will be appreciated, however, 
that the system and method in accordance with the invention 
5 has greater utility such as to systems for delivering other 
types of content to devices having different input/output 
formats. 

The content delivery system in accordance with the 
invention provides many advantages over the conventional 

10 systems. As described below, the system permits content in 
a variety of different formats, such as HTML, XML, raw 
data, etc., to be input into the system and then permits the 
content to be output in a variety of different output formats 
and protocols, such as WML, HTML, HDML, XML, etc so 

15 that the same incoming content may be displayed on many 
different information appliances and devices having differ- 
ent screen sizes. In a preferred embodiment, the system may 
use an XML data structure and the system may include one 
or more different software applications that may be based on 

2U the JAVA language. 

In accordance with the invention, content providers do not 
have to change their existing infrastructure or use multiple 
different web servers to send information to various infor- 
mation appliances. In particular, the system receives incom- 

25 ing content on-the-fly from an Internet content provider 
thereby allowing for dynamic information generation. In 
more detail, the system in accordance with the invention 
manipulates standard web pages into a relational markup 
language (RML) that permits, for eacb information 

30 appliance, a "set" of pages that is more useful to the 
individual device and dependant on the device. Now, the 
content delivery system in accordance with the invention 
^will be described. 

FIG. 2 is a diagram illustrating a content delivery system 

35 10 that may include a translation system 12 in accordance 
with the invention. The translation system 12 may be any 
computer system, such as a server or workstation, with 
sufficient computing power to handle the functions being 
performed as described below. In a preferred embodiment, 

40 the translation system 12 may be a server computer that 
stores one or more different software applications that may 
be executed by the CEU°fJhP server in order to implement 
the functions of the content dcliwry. system^ described 
below. The translation server 12 may allow content provid- 

45 ers 13 to deliver their content (in different formats as shown) 
to one or more different information appliances 15 without 
needing to reformat, re-author or rebuild an existing web site 
in order to deliver it to the different information appliances 
15 using different communications formats as shown. It is 

so desirable to provide an upwardly scalable robust server 12 
so that software can be coded without requiring updates as 
server loads increase. Rather, such increased server loads 
may be handled by adding additional hardware components, 
for example. r ncroor y^to the server. It is also desirable that 

55 the server 12 be platform independent to support any oper- 
ating system environment, such as UNIX, Windows, Macin- 
tosh and any other operating system, 
rf In more detail, the translation server 12 may take infor- 
mation directly from an Internet content provider's web site 

60 in various forms, such as HTML data, XML data, or rg w_jdata. 
feedsan d then re-deliver it T via the translation server_ 12juad 
th rou^Tatel eop ^ ^ tffE jft' 81100 **» a Tj jgfcss 

carrier base_sjajipn_that uses aty pical commul gcations 
forma t such as CDP D, to information appliances 13 in a 

65 format .Jh at is complete ly_cjistomizeji.Jp iS-JPffid, user's 
device t ype flpd browsing capabilities. Thus, the content 
delivery^ system and method may generate and output WML, 
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6 



1 



10 



20 



HDML, tiny HTML, compact HTML, HTML or XML data 
that is compatible with the particular information appliance 
15. The information appliances 15 may be any type of device 
including WAP compliant cell phones, Windows CE 
devices, Palm OS devices, and any other HTML browser 
based devices. 

For each wireless device 13, there may be a separate 
telecom system 14, such as different gateways or proxies to 
the different information appliances 15. Thus, in accordance 
with the invention, the content provider 13 may create a 
single piece of content in a single format and the telecom 
system 14 does not have to be modified in order to provide 
the wireless content in accordance with the invention to each 
separate wireless device 15. In addition, each Internet con- 
tent provider 13 also has the ability to customize the 
appearance of their web site pages using the system's web 
based GUI tool. The GUI t ool allows a web p a^e producer 
orj tevalop eLtocreJEj^ 

translation server 12 to describe how a particular web page 
is to be translated by the system 10. In particular, the "drag 
and drop" functionality of the GUI tool in a toolkit that 
allows a producer* level employee to tailor the "look and 
feel" of the site so that, when the web site is delivered to a 
wireless device 15 via the system 10, the same "look mad 
feel" can be delivered as well. Now, more details of the ^ 
wireless content delivery system will be described. 

FIG. 3 is a functional diagram of the content delivery 
system and in particular the translation server 12 in accor- 
dance with the invention. In the example shown, the content 
20 may be an HTML web page. The translation server 12 
may include an intelligent harvester 22, a tree synthesizer 
26, a tree analyzer 30, a card builder 32 and a deck builder 
3S that generate a presentation shoe 38 that may be sent to 
a particular information appliance 15. To intelligently har- 
vest an HTML web page not only involves grabbing the 
content on the site (scraping), but also allows any function- 
ality on that site to be enabled on the target information 
appliance or device. This enabled functionality may include, 
for example, forms, transactions, javascript, cookies, session 
data and security measures. This enabled functionality is 
possible due to the virtual browser (See FIG. 7) that 
provides, for example, javascript and cookie proxy engines, 
so that an information appliance that cannot support javas- 
cript may do so with the javascript being eexecuted on the 
translation server. As another example, an information appli- 
ance may not support persistent session that may be required 
by the web site, but the virtual browser may enable a 
persistent session so that the web site thinks that it is 
interacting with an information appliance that has the per- 
sistent session capability. This process also involves apply- 
ing a set of rules describing the relational context of the 
content (e.g., bow a piece of, for example HTML code, 
relate to each other). 

In more detail, the intelligent harvester 22 may receive the 
content and generate a relational data structure 24 that 
corresponds to the content as described below in more detail 
The data structure containing the content in a relational 
format in accordance with a preferred embodiment of the 
invention is a proprietary relational markup language known 
as RML. RML is an XML based language which has the 
advantage of permitting the easy mapping of the content into 
a tree structure by the tree synthesizer 26 so that the tree 
synthesizer may output a typical document object model 
(DOM) 28. The DOM is a common object model used to 
manipulate markup such as HTML such as it disclosed on 
the W3C web site at http://www.wc3.org. Although it is 
typically used for manipulating HTML or XML, it also to 



provides the tree structure needed by the layout engine as 
described below during the tree analysis functions. 
Generally, a tree data structure is a method for representing 
a hierarchy of data using tree diagrams formed from nodes 
and line segments between the nodes. This may be a bit 
confusing because the DOM's tree structure may be used for 
both its intended purpose of storing the HTML markup 
contained in atomics, but also as a way of storing relational 
information about those atomics as described below in more 
detail 

\ ^ The tree analysis function 30 may receive the DOM and 
then dynamically generate pages for a variety of different 
target screen sizes. Id more detail, a typical, well known 
depth-first tree search algorithm, such as that described in 
the CRC Dictionary of Computer Science, Engineering and 
5 Technology, is used to recurse through the atomics and 
\1 groups with complicated rules as described below for opti- 
mally filling pages with content while creating an intelligent 
navigation scheme in accordance with the invention. The 
Card Builder function 32 may apply the protocol specific 
syntax needed by the particular target device and output a 
presentation card 34. In accordance with the WAP protocol, 
the Deck Builder 36 is needed to put the cards into decks and 
into a presentation shoe 38 to optimize transmission latency 
through wireless networks. The system also permits the 
intelligent interaction of the information appliance with the 
web site since the system may enable functionality not 
typically supported by the information appliance via the 
virtual browser. The functionality of the content delivery 
system will be described in more detail below. Now, a 
prefcrreoUianlw^rc 
system in a^rardan^ 

FIG. 4 is diagram illustrating a preferred implementation 
of the translation server 12 of the content delivery system 10 
in accordance with the invention. In particular, the server 12 
may include a consent, co nnection handler 40 , a layout 
engine 42, an appliance connection handler 44 and an XML 
engine 46. The translation server 12 may also include a 
database 47 that may contain XSL rules used by the XML 
engine 46 for cwrjKej^^ one or 

more URL Ids and various device information. In accor- 
dance with the invention, each XSL rule may be indexed in 
the database based on an ID (the ID may contain a URL, a 
name/value pair and cookie information) so that the system 
may determine which rule applies to which incoming URL. 
The device information may be used by the layout engine 42 
in order to convert the RML data into one or more cards in 
a deck that may be displayed on the particular device. For 
example, the information may indicate the amount of infor- 
mation that may be fit on each screen. The translation server 
12 may also include a long term database 48 that may 
contain cookies so that the system knows which pages have 
been processed previously and a session database 50 that 
stores a presentation shoe for each device. 

Web page requests made by an information appliance IS 
to an Internet content provider 13 may occur using, for 
example the Wireless Application Protocol (WAP), a wire- 
less data transmission standard described in the WAP 1.2 
Specification Suite located at http://www.wapforum.org, 
incorporated herein by reference. Generally, in accordance 
with the WAP protocol, an information appliance 15 may 
initiate a request in WML (Wireless Markup Language), a 
language derived from XML especially for wireless network 
characteristics. This request is passed to a WAP gateway that 
then retrieves the information from the Internet. The 
requested information is then sent from the WAP gateway to 
the WAP client (the device 15), using whatever mobile 
network bearer service is available and most appropriate. 
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The data transmitted over the telecommunications net- 
work 14 (See FIG. 2) is represented as a group of data 
packets and electronically routed over the telecommunica- 
tions network 14 to an appropriate destination node, such as 
an appropriate information appliance 15. The packets of data 
include header information that describes the information 
contained in the data packets, such as the type of data 
contained in the packet, Lc. HTML, voice, ASCII, etc., and 
origination and destination node information, and is used by 
the system 10 to route and configure the data packets for 
transmission through the wireless network 14, according to 
well known network routing techniques. 

In accordance with the invention, when an information 
appliance 15 requests a web page from the content provider 
13, the content provider 13 examines the header information 
contained in the data packets and redirects any non-PC 
requests to the translation server 12. In accordance with the 
invention, content from an Internet content provider 13 is 
received by the content connection handler 40. The content 
connection handler 40 mimics a standard HTML browser, 
such as Internet Explorer or Netscape, and functions as the 
interface with a content provider's web site. Non-PC 
requests are redirected to the transl ation server 12 so that the 
web page information can be translated into a data format 
appropriate for and recognizable by the destination infor- 
mation appliance 15. Desktop PC requests do not need to be 
redirected to the translation server 12 since these devices are 
a ble to display web pages m standard HTML or other similar 
formats a nd do not require customlzc^rwe^p^ie^m^rma- 
tionT 

^ "The appliance connection haodler 44 opeiates as a Web*, 
server for a requesting information apphance 15. T^ appli- \ 
ancc connection handler 44 brokers and controls the entire i 
transaction between i n tteje<]ues^gdeyioe 15 and the trans- j 
latidn server 12. For example, the appliance connection j 
handler 44 may handle the session with each telecom system j 
14 or wireless device 15 and may perform various functions - 
such as establish a session with the information appliance or \ 
wireless device 13, retrieve_pa^e Jnfo j 
content comiectipa _baiiaUer translate recjeiv&Lpages i 
using the X&^ 

described below, and transmit the translated page informa- : 
tion to a requesting information device 15 with appropriate 
header information. The appliance connection handler 44 . 
may also determine state information of the devices 15, : 
synchronize the devices 15, determine browser and protocol 
information and perform various security operations. 

The XML engine 46 converts the XHTML page, gener-' 
atecV by the content connection handler30^1aA.propiigtary 
markup language, RML — the Relatio^^Maiku^Language, 
using XOUg^ Q^^may st oredm a databa^ 47TRMLis 
a markup language written uTXlWL. Eseirpage^type may* 
include 1) a XSL rule-set that specifies what pieces of 
content to display on vary appliances as well as the relational 
structure amongst those pieces of content; and 2) a_perl 55 
script. XSL is a language for traiisforming an^XN^dogj T 
m cnunto anotner XSftL c&cu ment as described at the W3C 
website lo ^tcd^ajJu^p^w ww.wScon ^. The XSL rulcscts - 
used Dy the XML engine 46 may be stored in a database 47 
indexed by ID page-type identifiers. The XSL rule-set may 
be used to process hierarchical elements of the content while 
the perl script may be used to process unstructured markup 
in the content. In accordance with the invention, a unstruc- 
tured tag may be used to mark the markup that may be 
processed using the perl script. Each page-type has a rule-set 
and possibly a perl script associated with it. In accordance 
with the invention, the XSL rule-set and the perl script may 
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be generated manually or may be automatically generated. 
The information stored in the database 47 can be used by the 
XML engine 46 when converting page information. 
1 1 The layout engine 42 processes the content to convert the 
relationaJURL^^ received from the XML engine 46, 

intSTdevice and protocol specific mark-up language formats. 
Formatted output provided by the layout engine 42 will be 
referred to herein as a "presentation shoe.'* For purposes of -j 
this description, a presentation shoe consists of "decks'* and 
"cards." A deck is the smallest unit of content that is sent to 
a device. In certain protocols, such as WML and HDML, a 
deck can contain multiple cards. The presentation shoe thus 
contains the original HTML content of a webpage that is 
reformatted into an appropriate format language and tar- 
geted at an information appliance 15. The presentation shoe 
is formatted specifically for the target appliance's screen 
size, user interface and protocol Advantageously, tlje_pj£- 
sentation shoe js^rcated dynamically in accordance with 
relalio^ be 
described in detail herein. Information about the presenta- 
tion shoe can be stored in a session database 50 that may be 
in communication with the layout engine 42. Now, the 
details of the operation of the translation server will be 
described. ■* 
FIG. 3 is a block diagram illustrating operation of the 
wireless content delivery system 10 of the invention. As 
described above and shown in FIG. 5, an information 
appliance 15 may request page information of a particular 
URL website from a content provider 13. The request by the 
information appliance 15 may be redirected, for example, 
via a JAVA servlet 60, to the translation server 12. The 
appliance connection handler 44 examines header informa- 
tion from the requesting data in order to determine a target 
device 15, protocol and browser configuration. The appli- 
ance connection handler 44 then requests the desired URL 
information from a content connection handler 40. The 
appliance connection handler 44 may relay information 
about the requesting wireless device 15 to the content 
connection handler 40 such as a requested URL address of 
a content provider 13, a session ID, a user ID and a wireless 
device's browser capabilities. The content connection han- 
dler 40 retrieves the requested information from the content 
provider 13, renders any JavaScript, stores any cookies and 
returns the requested information as XHTML data to the 
appliance connection handler 44. 

The appliance connection handler 44 then requests the 
XML engine 46 to convert the received XHTML data to 
RML data and assign atomics into the relational structure 
according to page type so that presentation cards can be 
created and placed in a presentation shoe so that the cards 
can be transmitted to the target device 15. The XML engine 
46 will be described in more detail below with reference to 
FIGS. 7-9. 

The application connection handler 44 then conveys the 
RML data output from the XML engine and device infor- 
mation (stored in the database 47 shown in FIG. 4) to the 
layout engine 42 so that the layout engine 42 can generate 
a device and protocol specific set of cards that are served to 
the requesting appliance 15 by the appliance connection 
handler 44, via the presentation shoe. 

Id more detail, the layout engine 42 may include a layout 
processor 62, a preprocessor 64, a recursenode module 66, 
a card formatter 68, a deckbuilder 70, a contentcutter 72, a 
ordernodes module 74, a guidehandler 76, a recursenode and 
recurse atomic module 78, 80, a navigation builder 82 and 
a shoebuilder 84. The layout processor 62 may receive the 
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XML DOM, the device information from the database 47 appliance. This function is perfor med by an HTTP client 
and the shoe ID and may forward the shoe information to a connection module 90. The HTTP client connection module 
preprocessor 64. The preprocessor may forward the XML 90 functions to maintain a communication session with the 
DOM and class information to a content cutter 72 that content provider 13 and handle cookie and security (SSL) 
operates on the RML data to remove any classes of infor- 5 information. An HTML data processor 92 may render Java- 
mation that cannot be displayed on the particular infbrma- Script data in a proxy fashion for the requesting information 
tion appliance or wireless device 15 based on the device appliance 15. With a scripting language, such as JavaScript, 
information from the database 47. Once the classes are the client-side source code is embedded directly into the 
removed, an ordered structure of atomics is generated from HTML page and a client-side software plug-in that interprets 
the RML data by the OrderNodes module 74 so that, for 1Q that language is automatically activated while the HTML 
example, a tree structure describing the relation of different page is being displayed. This client-side software is not 
portions of the content information to be displayed on the available on most information appliances, requiring our 
device 15, can be created. A GuideHandler module 76 servers to proxy this function. An HTML cleanup module 94 
permits the system to put a template on each card of the first may generate XHTML page information so that the trans- 
card in a shoe delivered to an information appliance. For 15 la tion server 12 can translate the received HTML page 
example, the logo of a company may be displayed in the information to a format readable by and appropriate for an 
comer of each page displayed to the information appliance. information appliance 15. Now, a method for intelligent 
The navigation builder 82 may permit links between cards harvesting in accordance with the invention will be 
and other items to be placed on a card. For example, a described that includes some of the elements described 
particular web page may always have one or more naviga- 2Q above. 

tion links on each page (e.g., support, products, etc.) that FIG. 7 is a diagram illustrating a method for intelligent 

may be placed on each card by the navigation builder. harvesting of web pages in accordance with the invention 

The RecurseNode module 66 and 78 may then operate on that uses the elements shown in FIGS. 3 and 4. In particular, 

the tree structure to group related content information so that the content 20 is fed into a virtual browser 1C0 as described 

appropriate cards can be created for presentation on the ^ briefly above which passes on the content to the XML 

wireless device 15. The recursion of the nodes in the tree is engine 46. Using the particular page-type rules sets stored in 

described in more detail below with reference to FIG. 14. In the database 47, the XML engine 46 may convert the content 

processing the tree structure, the card formatter module 68 into relational markup language (RML) format 24. In more 

may generate presentation cards that include the appropri- detail, a session on the Internet is handled by the virtual 

ately grouped atomics as determined by the recursenodes ^ browser 100 located on the translation server 12. This virtual 

and recurseatomic modules 66, 78, SO. The deck builder browser provides the important functionality of proxying 

module 70 may then groups the plurality of presentation javascript (using a Javascript proxy engine 102) and cookies 

cards into decks so that the deck can be organized into an (using a cookie proxy engine 104) for the target devices. For 

appropriate presentation shoe to optimize the wireless link each page-type, an XSL rule-set is stored (See the W3C 

between the system and the information appliance. To 3S specification at http://www. w3.org/TR/2000/WD-xsl- 

pcrfonn the above function, the deckbuilder 70 may include 20000112) in the database 47. This rule-set contains infor- 

the navigation builder 82 that builds the navigation path mation about the division of content into atomics, content 

between the cards in the deck and the shoebuildcr 84 that classification, and the relationships amongst these atomics 

actually builds the presentation shoe. for the particular page. The XML Engine 46 uses these XSL 

Once the presentation shoe is completed by the layout 40 rule-sets to translate the HTML into RML. The division of 

engine 42, it returns the presentation shoe to the appliance content into atomics can be generated either automatically or 

connection handler 44. The appliance connection handler 44 by a producer using a tool set. The content classification 

then may relay the presentation shoe, for example via the information is used to determine what content and function- 

JAVA servlet 60 to the appropriate requesting wireless ality are appropriate for which classes of devices. The 

device 15 so that the content information from the content 45 content classification information may be generated either 

provider 13 can be displayed in a format appropriate for the manually or automatically. Now, more details of the XML 

wireless device IS. engine in accordance with the invention will be described. 

FIG. 6 is a diagrammatic view illustrating the function of FIG. 8 is a diagrammatic view illustrating more details of 

the content connection handler 40. The content connection the XML engine 46. As described above, the XML engine 46 

handler 40 establishes an on-line session with a web site in 50 e xtracts content from dynamic ally changing XHTML„infprr 

order to receive webpage content to be converted. In a mat ion and generates a correspol^^ 

preferred embodiment, HTML pages are retrieved from a RML file, in accordance with XSL 

specified URL website and the retrieved HTML information ralc^te dkej^ used to convert 

is formatted as XHTML, an XML compliant HTML format between Xffln^'and""RKC. An j 

utilized by the translation server 12 and represented by a 55 example of an XHTML document bcin^colE^iie3"'into I 

document object model (DOM). A DOM is a common object RML using an XSL rulesets is described below. ^ J 

model used to manipulate markup. This structure is used to ^ In operation, the XML engine 46 m ay receive a papc-typc, 

translate the HTML page information to a format that can be ^designated by URL, name/value pairsTand cookie informa- 

translated by the XML engine 46 to be described in detail tion and fl&ges of XHTMITinTormatfon fom lhTc ontent 

below. The content connection handler 40 may communicate go cohnection-jiaiicUef ^ 112 

with a long-term database 48 (See FIG. 4) that is configured mayrecewe^rta^^ 47^ / 

to store cookie information about the web sites from which defjg e3jgwnhe~m^^ the URL website is" to be>> / 

page information is retrieved. co^^^rDrJ^rsE^^Eru^BeTs may belised [dcp&o^agQn I 

As described above, the content connection handler 40 mft~6QOT&riQ&^ j 

functions as the interface with a content provider's website. 65 naLHIML page informati on bag been received. laconjunc- J 

In a preferred embodiment, it acts as a virtual browser, tionjyith the rulesets determined 07 the hasbtable 112, an 

proxying browser functionality on behalf of the information XSL transform processor 110 may convert the received 
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XHIJ^iiiformation to RML informadoa tftat ptWf<Sed 
to the layout engine 42 so that it can be convertedWo^evke 
and protocol specific mark-up language formats. 
Additionally, the XSL rulesets may permit the Internet 
content provider 13 to control the look and feel of the 
content (e.g., the location and/or inclusion of certain ele- 
ments of the content together on a single card) regardless of 
the wireless device 15 on which the content is being dis- 
played. Now, an example of an HTML page prior to 
conversion, an XSL ruleset that may be used to convert the 
particular HTML page and the resulting RML page in 
accordance with the invention will be described. 

Below is an example of an HTML web page that may be 
converted in accordance with the invention into RML. 
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-continued 



SAMPLE HTML CODE FOR WEB PAGE 

<htm]> 
<head? 

<title>Welcome to Foo.com I <AiUe> 
</bcad> 

<body> \ 
<p><a hxef- K http^Mww.foacom/logia/ n >Log In</a></p> 
<p><a hrc&>"httpJ/www.£oaa>in/sigmip^>sigD Up<Jn><Jp> 
<hl aligSB*centef*>Foo^oin</hl> 
<h4 align-"center**>Your source for all thing? foo!</h4> 
•dable> 
<&> 

<th coUpaii-"2 M align-"ccQter">Foo Prodacls<Ah> 

<ti> 

<td>Foo Fighter</td> 

<dd>$19i>5</td> 

<ftr> 

<tr> 

<td>Foo Peacemaier</td> 
<td>$29SS</td> 

</tr> 
<tr> 

<td colspaB-**2'*> 

«a hrcf="httprwww.6oo.com/buy w >Buy these 

wonderful Foo's!</a> 

<Ad> 

</tr> 
<Jtab\c> 

<p>(c) 2000 Foo.com </p> 
</body> 
</html> 



<^pauo 
</navig»tioil> 
^ </guidc> 

<group name-** Main" class-" 1" seqnential«'^equeiitial n > 

<nl:npply- templates sdect-**body7> 
</group> 
</rml> 
</rsl:tempiate> 
10 <3El:tempIate inatch-"body**> 

atomic iiame-Title" class-M**> 

<b><xBlrvalue-of select-"h47x/b> 
</atonrio 

<xal:apply-tcmplatc3 select-** tablc'7> 
</xsllemplato 
15 <xsl:template matcbV 4 table"> 

<group namc-Tby Table" dass-T^ 

<atomic name-Table Title*' class-" 1" sequential=* , sequential M '> 

<b><XBl:value-of fielect- M tiAh7></b> 
</atomic> 

<xsUfor-each selecf>"U(td[not(®colspan)ir*> 
<group name-Toy* dase-"!*^ 
<Dl:fbr-each se!ect- H td n > 
<atomic name— "Entry" > 

<xsl:va]u©-of select-**. 7> 
</atomio 
</xsl:for-each> 
</group> 

23 </xsl:for-cach> 
</group> 
</zsllempLate> 
</2Ql3tyIesheet> 



Z 30 The resulting RML code in accordance with the invention 
' is set forth below: 



<IDOCTYPE html PUBUC V/W3Q/DTD HTML 4.0 Transi- 
tional//EN"> 
<rml> 
<head> 

<tille> Wei come to Foo.com! <Aitfe> 
</bead> 
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Now, a XSL mleset that may be used to convert the above 
HTML page into RML in accordance with the invention is 
set forth below. 



45 



<xsl:styte&heei xmlns^l-"bttp^A?ww.w3.org/19^/XSlVTraniform"> 
<xsl:tempiate match- M *^V<^a^y^ten9latea/>^xal:teinplate> 
<xsl:template match=»**teilO{@ - "><xsl^atue-of selecfc--.7x/XBl:tcm- 
plate> 

<Z8l:template matcb-*'htmr> 
<nnl> 

<head> 

<tiile><XB I rval ue-of select-7Aitle7></title> 
</hcad> 
<gnide> 

<navigation> 
<pane> 

<xsl:for-each sdect-*'bc<iy/p/a"> 

atomic Damc-"Goto^ class-"!" 

column-* 1 column H > 

<a href-** {@hre f} "xatsl rvalue-of 

select^**nonnali2e-space(.)7></a> 

</atomic=> 
</xsl:for-eacrj> 
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<panc> 

<atomic name— "Goto:** class—** 1** column— **oolumn"> 

<a hxef-"httpVAvww.mo.com/login/'>Log ln</a> 
</atomio 

•catomic name— "Goto:" claii-~l** column- **oohimn"> 

<a href-" http -Jfwvrw. foo . com/8ignupr*>Sign Up</a> 
</atomio 
</pane> 
<ynavigation> 
</guide> 

<group name— "Mais'* class- M l** sequenrial-"sequentiar'> 
<atomic name-Title "dflsi- , 'l'*> 

<b>Your source for all things foot^/b> 
</atomio 

<grocp name-Toy Table" dass» M l**> 

<atomtc name-Table Title** class-"!** scquc Diiol- M sequential "> 

<b>Fbo Products <ft» 
</atomic> 

<group name-Toy Table" claaa-**l"> 
<atomic name-**Entry">Foo Fighter </atomic> 
<atomic name-"En*jy">$ 19.95 <Motcuo 

</group> 

<group name-To y" dxsB= M l**> 
<atomic name-"Eotiy">Foo Peac«maker<Momic> 
<atomic nftme-*Tntry**>$29.95 </aiomio 
</group> 
<ygzoup> 
^/group> 
</rml> 
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The above RML code may also include the unstructured 
tag as described above that m surrounds a particular portion 
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of the mark up language that is unstructured and may be 
processed using a perl script Now, a graphical example of 
HTML content being converted into the RML in accordance 
with the invention will be described. 

FIG. 9 is a diagram illustrating an HTML page 12Q being 
converted into an RML page 122 in accordance withlheT 



inyep h onTT br purposes of the conversion, each piece of 
discrete content with presentation structure or no structure 
124 is mapped into an atomic 12 6 in the RML code. It should 
be noted that thd hyperlinks within a piece of content are 
maintained in the atomics. In addition to the atomics, groups 
of content 128 in the HTML page may be converted into a 
group 130 in RML In this example, the group is non- 
sequential (e.g., the content in the group does not need to be 
displayed sequentially). Now, an example of the division of 
a graphical web page into atomics and groups in accordance 
with the invention and the layout engine 42 will be described 
in more detail. 

FIG. 10 is a diagram of the layout engine 42 in accordance 
with the invention. As described above, the layout engine 42 
formats a content source for a specific device's screen and 
inherent capabili^sjhe layout eng5»^2may include the" 
content cutter 72, the layout processor 62, the card builder 
68, and the deck builder 70. ^c^sS£^^}^7Zj^^^\M 
cc^tentof format and content classes not appropriate for the 
speciBc~3evice frQmJhe-jpece^ iTT^f^ mgc to create an 
XML representation of the reccwff*onSnarwebpage. The 
layout processor 62, using prior knowledge of the device 
type and the content, dynamically devises an optimal layout 
and navigation structure for the particular device 15. Thus, 
the output of the layout engine 42 is a presentation shoe in 
the appropriate presentation protocol for a particular appli- 
ance 15. 

Presentation shoes arc built from the bottom-up, resulting 
in the highest priority "atomic" being placed on a card first. 
An atomic is the smallest unit of a web page that encapsu- 
lates an idea. For example, an atomic may be a paragraph of 
text, a heading, a link to a news story, a picture, etc. Atomics 
may be grouped together to reveal relationships between 
them. Groups may be nested to form a complex relational 
hierarchy. These groups can be placed on cards so that 
customized presentation pages can be transmitted to a device 
15. 
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171, 172, 173 make up the root group 170. These groups 
constitute the relational hierarchy for this portion of the 
E-TRADE website. 

To display this website on a display device 15, such as a 
Palm Pilot or a Windows CE device, the groups and atomics 
need to be organized and placed on cards that make up the 
presentation shoe. Cards are created by examining how 
groups best fit onto the cards. A tree data structure can be 
generated from the RML object As described above, nesting 
groups describe the relational context of content contained 
in a webpage. In RML formatting, each group typically has 
three primary attributes: name, class and priority. The name 
attribute is used during a navigation pass through the tree as 
a link description to the group if needed. The class attribute 
is used by the content cutter 72 to determine the content in 
the webpage that is appropriate for a specific device's 
capabilities. Thus, the class attribute allows different levels 
of content to be presented to different classes of devices. For 
example, the general classes of devices are shown in the 
following table, but the number of classes may be increased 
or decreased. 



25 



Class General Device Class 

1 Cellular telephones - very limited screen size 

2 Palm Pilot - low resolution, black and white or monochrome 
display 

3 Windows CE - high resolution, color display 
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The priority attribute indicates the importance of each 
portion of the content and is used by the layout generator 42 
during preprocessing to order the content into appropriate 
groups. To further organize the groups, each group also has 
Boolean attributes, such as columnize, sequential and other 
attributed such as association. The association attribute may 
have the values of keep together (keep the atomics together), 
isolate (isolate the atomic on its own card) or null. The 
columnize attribute indicates whether an atomic/group 
should be placed adjacent another atomic/group on a card. 
The sequential attribute specifies how the navigation struc- 
ture should be designed. For example, atomics containing 
paragraphs of a story would be grouped as sequential 



information. In contrast, a list of links to news stories would 
In operation , an RML document i s received by the layout \ s be grouped as non-sequential information. The keep 
^ * * 1 ~"~ t " ~~ J ' together attribute indicates to the layout engine 42 to keep 



engine 42 from which the content cutter 72 cuts data classes 
t hat are not appropriate f or the requesting device 13 Jo ^ 
generate an XML document containing the cut content, 
'device information that specifies the target device, and 
protocol information that specifies the target protocol. 

Card creation will be described with reference to FIGS. 
11-13. An example of a portion of an HTML web page 170 
from the E-TRADE website is shown in FIG. 11. In the 
Figure, the innermost dashed boxes designate atomics while 
the boxes enclosing them constitute, groups. At the top 
portion of the page 170 is a quote look-up fionn 171. The 
quote look-up form 171 is made up of three atomics, a 
"Quotes" tide portion 17Li, an entry box_Hlfejind * "Go" 
submission button 171c. Further, the jnaritcLgraph JJ2a, 
table .1726, and Fool.com a^^^mi?Jit.JL7Zc^&it each 
related atomics and are grou^ 

172, In addition, each elemenfin the market graph 172a may 
also be an atomic so that "NASDAQ" is an atomic, 
"2756.27* is an atomic, the down arrow is an atomic and 
"-5.48" is an atomic. Finally, the TheStreetcom logo 173a, 
and the four news stories 1736-c are each related atomics 
and are grouped together as group 173. All of the groups 



children of a node together on the same card if possible. The 
isolate attribute specifies that the group is be placed onto its 
own card. 

50 In addition to the above, knowledge of the number of 
characters and pixels horizontally and vertically on the 
target wireless device 15 is desirable. In addition, for devices 
15 that allow fonts, a font width calculation may need to be 
made. A bandwidth and screen dependent variable may be 

ss assigned to determine how much content is allowed to scroll 
before a new presentation page is created vertically and a 
link created to subsequent presentation pages. Another vari- 
able may store the minimum container width (how narrow 
the text container can be made so that another container can 

«j be put along side it). 

U \ FIG. 12 illustrates the relational structure of the groups in 
the E-TRADE web page shown in FIG. 11 that may be 
rep resented in a tree structu re. This separation of content 
fromstyle in a relational way that can be represented in a 

65 tree allows for the re-formatt ing of thecontent in accordance 
wi|±QhBjnvcnJwn In the Figure, a root elcment node 180 
refers to the outermost group of the tree and indicates the 
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E-TRADE web page 170. The nodes 182-186 (roman ) is passed to Recurse Atomic step 208. In Recuise Atomic, an 
" " atomic object is created using the node's contents. This 



numerals I, II, and III) are the "children" of the root node and 
refer to the atomics 171a, 1716, 171c of the Quotes portion 
171 of the E-TRADE web page 170. A node 188 (roman 
numeral IV), another child of the root node, refers to the 
market group 172 and includes children nodes A-C (market 
graph 172a, table 1726 and Fool.com advertisement 172c of 



Atomic is then placed on to the current pane in step 210. 

There are certain circumstances, such as cohimnizable 
atomics, where the current pane is added to the card and the 
current atomic is placed in a new pane. A cohimnizable 



FIG. 11). Similarly, a node 190 (roman numeral V) refers to » , ? nuc » « placed adjacent to another atomic, 

the news section 173 and includes children nodes A-E (the I Wben M of ,he chfldren of a group have been passed to 

news story links 173a-* of FIG. 11). The flexibility of trees i „ Reause Node m 212 ' RccuBe Grou P 206 men ,ries to 

and directed graphs allows them to represent any structure f add cumal pane to the current card in step 214. If the 

found in a web site. Thus, the trees provide a method to i fits on & c card ( mc test occurs in step 216) then 

organize the complicated relationships amongst content in a I Recurse Grou P rcturas m step 218 to Recurse Group 206 and 

web page for formatting as well as provide navigation links 1 ** recursion continues. If the pane doesn't fit on the card, 

between cards in a presentation shoe. an exception is handled in Recurse Group where one of two 

To provide a simple tree structure that can be efficiently , c "f. b, «^" P T """V" ^ ue ° tial atomics f 

navigated, me tree is sorted by priority. This pre-processing <" *** m "* 220 >' ^ ' he pine * ffld 1 ,s m ^ v of 

r(1/ ,T„ 1 f . . # . , 4 ,/ 4 t, & its atomics that can fit on the current card are placed into a 

reorders the tree to put the content that will appear on the . 4 t , . „- A . 

<4«*nn. ic * 4 .li ft . j * AJi new pane and put onto me card (steps 222. A link to the next 

device 15 first m the left-most nodes of the tree. After the A K , , *V . , ... . ... 

♦™ u«, L. a _ . j „ » * i.j- card is placed at the end of the full card and then the card is 

tree has been sorted, the groups and content are evaluated in - n , , f it _ , , , . . ... _ 

order to assign the content to one or more cards using the M placcd Z^L ^ * ^ ls ( crc f led J 24 - 

attributes described above. Adepth-flrst search algorithm as «f remauung cards is put onto the new card in 

, -» i , . , , . , . . step 2Z6 and the recursion continues in step 223. 

described above, using a recursion technique that considers F F 

outgoing edges of a vertex before any neighbors of the If lhe P anc contains nonsequential atomics (as tested in 

vertex, may be used to implement this evaluation as „ ste P ^n 106 current node is cloned (duplicated). Then, 

described below with reference to FIG. 13. As this search sequential atomic nodes that are links to the original children 

evaluates each vertex, it attempts to optimally fit the atomics 416 created ^ step 230 **** added as children to the original 

onto cards. node wa ^ c the original children are removed. The cloned 

FIG. 13 is a flowchart illustrating a method 20® for node contains the original children. The Recurse Group step 

recursing through the tree generated by the XML engine in 30 20S then iterates tluough these original children by passing 

order to generate customized panes and cards that may be them to the Recurse Node object 202. If no exceptions occur, 

displayed on an information appliance or wireless device 15 Recurse Group 206 then iterates through the original node's 

in accordance with the invention. The recursive method new children. This iteration is handled in the same manner 

starts with the root node—the top node in the tree (See FIG. 35 me on S mal Nation through the group's children. Once 

12). In this case, the root node is the top-level group node 35 of the atomics have been placed in panes and these panes 

that encompasses all of the content that shall be placed into pUced m cards > recursion should return to the root node 

cards. The goal of the recursion is to place content into cards and the rccursK)n ends * 

in such a manner that the context of the content remains FIG. 14 illustrates a collapsing methodology for process- 
while creating an intelligent manner of accessing the content uig the tree to create cards that can be transmitted to a device 
(an intelligent navigation scheme) for each different infor- 40 15 * The recursion of the tree as described above begins at the 
mation appliance or wireless device that may have different root node and proceeds to its first child node 182 (node I), 
display capabilities. As described above, all of the content the "Quotes" text Ilia. Anew cohimnizable pane is created 
resides in atomic nodes and all atomic nodes are children of anfl " atomic I (171a in FIG. 11) is added to it. This pane, as 
groups. An example of the groups and atomics are illustrated *t is closed, is added to a newly created card. The recursion 
in FIG. 11. Groups can contain other groups or atomics 45 process then continues to the next node 184 (node II), the 
while atomics can only contain content, for example, entry box 1716, and adds this atomic to a new cohimnizable 
HTML. pane. Similarly, node HI, the "Go" submission button 171c, 
FIG. 13 is an example of how the recursion method may fc added to a new columnizable pane. These panes are passed 
be accomplished but many other implemenutions are pos- back t0 root n °d e and a card 240 is created that includes 
sible. Recursion through the tree representation of the web 50 me lnree pan es - 

site begins at the root node. The depth-first search proceeds The recursion process continues to node 188 (node IV), 

to the first node without any children. This is the first atomic the market group 172, and immediately to leaf node A, the 

having the highest priority. In addition to the previously market graph 172a. A new pane is opened and atomic A is 

described attributes, two additional attributes arc relevant to added to it. The process continues to leaf node B, the market 

the creation of cards that are placed in the presentation shoe., 55 table 172b. Atomic B is added to the pane, as is atomic C, 

"panes" and "frames." Multiple atomics can be placed, top according to the same processing, and the recursive process 

to bottom, into a pane. Panes can be assigned to cards continues through the tree to node 190 (node V), the "News" 

adjacently (columnized) by placing them into two frames section 173. A new card 242 is created that includes atomics 

side-by-side on the card. A-C of group IV. Similarly, atomics A-E (news stories 

The recursion for the tree starts by passing the root node 60 173a-e of FIG. U) are placed on a third card 80c and the 

to the Recurse node object 202 which determines what kind recursive process completes. 

of node (e.g., a group or an atomic) is being manipulated in The output of the card generation is shown in FIG. 15. 

step 204. In the root node's case, the passed node is a group Each of the cards 240, 242, 244 is shown. As shown in the 

and it is passed to the Recurse Group object 206. The Figure, the first card 240 includes atomics 171/1, 1716, 171c 

Recurse Group object iterates through the node's children, 65 from FIG. U wherein each atomic is in an individual pane, 

passing each of them back to Recurse Node 202. In the case The second card 242 includes atomics 172a, 1726, 172c 

where the node Recurse Node receives an atomic, the node from FIG. 11 all grouped as a single pane on the card 242. 



06/25/2003, EAST Version: 1.04.0000 



US 6,430,624 Bl 



17 



18 



The third card 244 includes atomics 173a— e grouped as a 
single pane. Whether atomics may be grouped on a card as 
a single pane or in individual panes or any combination 
thereof depends primarily on the contextual information 
relating to the atomic. 

Navigation amongst these cards is determined by the 
sequential attribute of the root node. Thus, in the non- 
sequential case, a card with links to the three cards 240-244 
shown in FIG. 15 would be created. In contrast, for a 
sequential case, navigation between the cards 240-244 
would begin at the first card 240 and links at the bottom of 
the first and second cards 240, 242 would lead to the next 
successive card (e.g., the second card 242 and the third card 
244) respectively. As mentioned above, the navigation 
between cards can be determined from the tree. 

The recursive process described above was simplified 
because each child of the root node fit on am individual 
page. The following description, with reference to FIGS. 
16-18, illustrates a more complicated card creation process 
in which the dynamic feature of the layout engine 42 are 
shown. 

FIG. 16 illustrates an example of a web page 290 from the 
CitySearch.com website. In the Figure, two different group- 
ings are indicated. The first grouping 291 includes the 
"Review of the Week" contextual information. In the group 
291, four atomics 291a-J are shown that reference the title 
291a, text 2916, picture 291c and mini review 2914. The 
second grouping 292 includes the "New Releases" contex- 
tual information. En the group 292, there are different num- 
bers of atomics for different sub-groups. Each of the sub- 
groups 293-296 are grouped according to the different 
movies. Atomics within these subgroups represent the 
names 293a, 294a, 295a, 296a of the movies, pictures 2936, 
294b, 2956, 2966 of the movies and mini reviews 293c, 
294c, 295c, 296c of the movies. Sub-group 293 also includes 
another atomic 293d referring to the title "New Releases." 
These groups and sub-groups constitute the relational hier- 
archy for this portion of the CitySearch.com website as 
determined by the XML engine 46 in accordance with the 
invention using the XSL rule set and perl script for the 
particular web page. 

The tree structure for the CitySearch.com web page 290 » 
is illustrated in FIG. 17. In the Figure, the root node 300 " 
("1") indicates the CitySearch.com website group. Child 
nodes 302, 304 (Nodes 2 and 3) indicate the two groups 
"Review of the Week" 291 and "New Releases'* 292. The 
atomics a-d of node 2 refer to the title 291a, text 2916, 
picture 291c and mini review 291o* of the "Review of the 
Week" group 291 of FIG. 16. The child nodes 306-312 of 
node 3 (nodes 4-7) and their associated atomics a-c of the 
tree represent the different movie sub-groups 293-296 
included in the "New Releases" group 292. 

In accordance with the invention, the layout engine 42 
recursively iterates through the relational tree structure, 
dynamically building cards for an appropriate screen size of 
a target device 157 As described above, the recursive process 
may incorporate a depth-first recursive algontnm to lmple- 
ment pfo^^iuguf ^ 

begins at node _l^the root . npdejof .thejmjand gfoo^^to*" 
node 2, theJ^Reviewffi 
tb e^higbest priority node that doe s, ,00! have an^, ffij pciate d 
chuteQ. The process conunues to node 2a, the title portion 
291a of the group 291. Atomic 2a is placed onto a pane "and 
the pane state is set to non cohimnizable. Next, the recursion 
process proceeds to node 26, the text portion 2916 of the 
group 291. The atomic state of the text portion 2916 is 
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compatible with that of the title portion 291a so the atomic 
2916 is added to the current pane. This process continues, as 
described above, until the entire tree has been processed and 
the atomics have been placed in panes on cards. 
5 '\p Depending on the target device 15 screen size capability, 
different presentation screens may need to be generated by 
the system 10 so that the information may be viewed on the 
device 15. For example, for a Palm Pilot device, the con- 
textual information associated with node 2, the "Review of 
the Week" group 291, may fit entirely on the display screen 
so that the associated atomics may be included on a single 
card However, the "New Releases" group 292 and its 
associated subgroups 293-296 may not fit on the card and 
new cards may need to be created in order to display this 
information on the Palm Pilot. Since node 3, the "New ~ 
Releases" group 292, is a non-sequential group, new cards 
for each of the subgroups 293-296 may be created and the J 
appropriate links may be i nserted into the cards so that a 
vie wer of the information on the clevice carT navigat e 
between display jp ages. An example of the presentation 
information of the CityScarch.com web page 290 shown on 
a Palm Pilot device is shown in FIG. 18A. — ' 

In contrast, for a cellular telephone device that has a 
limited display screen size, a different alternative of display- 
ing presentation information is possible and automatically 
generated by the system. In particular, depending on the size 
of the display screen, the system 10 may determine that 
cards are divided at different points along the creation 
process and the appropriate links inserted thereon so that a 
viewer of the presentation information on the phone device 
can navigate between the different display pages. FIG. 18B 
shows an example of the presentation information of the 
CitySearch.com web page 290 shown on a cellular tele- 
phone device. In particular, the information displayed on the 
full screen display of the Palm Pilot (See FIG. 18A) are 
broken into one or more smaller screens wherein, for 
example, the user of the phone must select the "Go** button 
to move down through the various titles of the movies that 
are reviewed. 

While the foregoing has been with reference to a particu- 
lar embodiment of the invention, it will be appreciated by 
those skilled in the art that changes in this embodiment may 
be made without departing from the principles and spirit of 
the invention and defined by the appended claims. 
What is claimed is: 

1. A system for intelligently harvesting information from 
a data source for one or more different information appli- 
ances having different input/output formatting capabilities, 
comprising: 

means for receiving web-based content information of a 

first input/output format; 
means for translating the received content information 
from the first input/output format to a different input/ 
output format that is recognizable by a specific device; 
and 

means for providing the translated content information to 
the device. 

2. The system of claim 1 wherein the device is a wireless 
device having a predetermined display screen size. 

3. The system of claim 1 further comprising means for 
receiving information from a device that indicates the input/ 
output format of the device so that the translated content is 
generated based on the input/output format of the device. 

4. A system for translating content information from a first 
formatting language to a second formatting language, com- 
prising: 
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a host system including content information of a first guage can be generated from the remaining content infor- 

fonnatting language; manbn of the second intermediate language; a layout pro- 

a translation server remotely connected with the host ccssor for dynamically generating a layout format for the 

system, the translation server configured to receive the content information of the second formatting language that 

content information of a first formatting language and 5 is optimized for the display screen size of the device; and a 

translate the received content information from the first protocol processor for generating the content information of 

formatting language to a second formatting language; mc second formatting language. 

and 17. The system of claim 16, wherein the layout format 

a device remotely connected with the translation server comprises a presentation shoe that includes at least one 

the device configured to receive the content informa- 10 P^ 0 ^ 00 ^ eacfa presentation card representing a 

tion of the second formatting language and display the dis ? lay pagc °- f thc cach P«wntotion card including 

content information least a portion of the content information of the second 

3. Hie system of claim 4, wherein the device is a wireless f 0 ™ 1 * 1 * ^rein subsequent presentation cards 

* . . J . j . ! , , utV1 ^ *? * UC1C ^ in the presentation shoe are linked so that the content 

device having a predetermined display screen size. informal of the second formatting language can be sclec- 

6. Thc system of claim 4, wherein the host system 15 avely OQ ^ device . B B 
comprises an Internet web server that includes at least one 18 . ^ system of cUim 17> wherein me preS entation 
web page thereon, the web page having a specific URL cards include at least one pane portion thereon and wherein 
Internet address and including the content information of the the content information is separated into atomic groups of 
first formatting language. content information, the atomic groups of content informa- 

7. The system of claim 4, wherein the translation server 20 tion being organized and assigned to the pane portions of the 
comprises means for establishing a communications session presentation card so that formatting of the content informa- 
with the host system so that information content of the first tion can be optimized for the display screen on the device, 
formatting language can be received by the translation 19. The system of claim 18, wherein the presentation 
server, means for receiving the information content of the cards in the presentation shoe are transmitted to the device 
first formatting language; means for converting the infbr- 25 50 tnat ^ content information of the second formatting 
mation content of thc first formatting language to informa- language can be displayed on thc device. 

tion content of an intermediate formatting language rccog- ^ c system of claim 7, wherein the device formatting 

nized by the translation server, means for translating the means further comprises means for receiving information 

information content of the intermediate formatting language a device mat indicates the input/output format of the 

to information content information of the second formatting 30 devioe and means for formatting the content information of 

language; means for formatting the content information of ?* e secoi ^ d formatting language based 01a the received device 

the second formatting language so that the content informa- information. 

tion can be transmitted to and displayed on the device. 21 * The svstem ot claim 4, wherein the translation server 

8. The system of claim 7 wherein the information content furthcr comprises a content connection handler. 

of the first formatting language comprises HTML. 35 22. The system of claim 21, wherein the content connec- 

9. The system of claim 7, wherein the intermediate uon handlcr comprises a client connection module config- 
formatting language is XHTML. ured to establish and maintain a communications session 

10. The system of claim 7, wherein thc translation means with me host system; a data processor configured to receive 
comprises an XML engine. mc content information of the first formatting language from 

11. The system of claim 10, wherein the XML engine 40 host svstem » a cleanup module configured to convert the 
comprises a hashtable module for comparing the URL content information of the first formatting language to 
Internet address of the received content information with a content information of the intermediate formatting lan- 
group of predetermined rulesets that define a criteria for means for translating the information content of the 
translating the content information to the second formatting intermediate formatting language to information content 
language; and a transform processor configured to convert 45 mformation of the second formatting language; and means 
the received content information of the intermediate format- for fonDattin 6 content information of the second for- 
ting language to content information of a second intcrmc- matting language so that the content information can be 
diate formatting language. transmitted to and displayed on the device. 

12. The system of claim 11, wherein the second interme- 23 A method for intelligently harvesting information 
diate formatting language comprises RML. 50 from a data s^ce for one or more different information 

13. The system of claim 11, wherein the group of prede- appliances having different input/output formatting 
lermined rulesets define a branding criteria for the content capabilities, comprising: 

information of the second formatting language so that the receiving content information of the first formatting lan- 

content information of the second formatting language dis- guage from an Internet web server; 

played on the device is similar in appearance to that of the 55 converting the received content information of the first 

content information of the first formatting language received formatting language to an intermediate formatting lan- 

from the webpagc. guage so that the received content information can be 

14. The system of claim 13, wherein thc group of prede- processed and formatted in accordance with a display 
termined rulesets comprise XSL rulesets defined by the host capability of a device; 

system. 50 translating the processed content information from the 

15. The system of claim 7, wherein the formatting means intermediate formatting language to content informa- 
com prises a layout engine. tion of the second formatting language; and 

16. The system of claim 13, wherein the layout engine transmitting the content information of the second for- 
comprises a content cutter for extracting portions of the matting language to the device so that the content 
content information of the second intermediate language that 65 mformation can be displayed on the device. 

are not compatible with a display capability of the device so 24. The method of claim 23 further comprising receiving 

that the content information of thc second formatting lan- information from a device that indicates the input/output 
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format of the device so that the translated content is gener- 
ated based on the input/output format of the device. 

25. The method of claim 23, wherein the device is a 
wireless device having a predetermined display screen size. 

26. The method of claim 23, wherein the receiving further 5 
comprises establishing a communications session with the 
host system so that information content of the first format- 
ting language can be received by the translator. 

27. The method of claim 26, wherein the information 
content of the first formatting language comprises HTML. 10 

28. The method of claim 26, wherein the intermediate 
formatting language is XHTML. 

29. The method of claim 23, wherein the translation 
further comprises performing operations with an XML 
engine. is 

30. The method of claim 29, wherein the performing 
operations with an XML engine further comprises compar- 
ing an URL Internet address of the received content infor- 
mation with a group of predetermined rulesets using a 
hashtable module wherein the rulesets define a criteria for 20 
translating the content information to the second formatting 
language; and converting the received content information 
of the intermediate formatting language to content informa- 
tion of a second intermediate formatting language using a 
transform processor. 25 

31. The method of claim 30, wherein the second inter- 
mediate formatting language comprises RML. 

32. The method of claim 30, wherein the group of 
predetermined rulesets define a branding criteria for the 
content information of the second formatting language so 30 
that the content information of the second formatting lan- 
guage displayed on the device is similar in appearance to 
that of the content information of the first formatting lan- 
guage received from the webpage. 

33. The method of claim 32, wherein the group of 35 
predetermined rulesets comprise XSL rulesets defined by the 
host system. 

34. The method of claim 23, wherein the formatting 
comprises using a layout engine. 

35. The method of claim 34, wherein the layout engine 40 
comprises extracting portions of the content information of 
the second intermediate language using a content cutter that 
are not compatible with a display capability of the device so 
that the content information of the second formatting lan- 
guage can be generated from the remaining content infor- 45 
mation of the second intermediate language; dynamically 
generating a layout format for the content information of the 
second formatting language that is optimized for the display 
screen size of the device using a layout processor; and 
generating the content information of the second formatting 50 
language using a protocol processor. 

36. The method of claim 35, wherein the layout format- 
ting comprises a presentation shoe that includes at least one 
presentation card, each presentation card representing a 
display page of the device, each presentation card including 55 
at least a portion of the content information of the second 
formatting language, wherein subsequent presentation cards 
in the presentation shoe are linked so that the content 
information of the second formatting language can be selec- 
tively viewed on the device. 60 

37. The method of claim 36, wherein the presentation 
cards include at least one pane portion thereon and wherein 
the content information is separated into atomic groups of 
content information, the atomic groups of content informa- 
tion being organized and assigned to the pane portions of the 65 
presentation card so that formatting of the content informa- 
tion can be optimized for the display screen on the device. 
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38. The method of claim 37, wherein the presentation 
cards in the presentation shoe are transmitted to the device 
so that the content information of the second formatting 
language can be displayed on the device. 

39. A system for translating content information from a 
first formatting language to a second formatting language so 
that the content information can be displayed on a device, 
comprising: 

a hashtable module for comparing a URL Internet address 
of a web page including the content information of the 
first formatting language with a group of predetermined 
rulesets that define a criteria for translating the content 
information to an intermediate formatting language to 
generate content information of the intermediate for- 
matting language; 

a transform processor configured to convert the received 
content information of the intermediate formatting lan- 
guage to content information of a second intermediate 
formatting language to generate content in the second 
formatting intermediate language; and 

a layout engine configured to translate the content infor- 
mation to the second formatting language from the 
second intermediate formatting language and to format 
the content information for display on the device. 

40. The system of claim 39, wherein the layout engine 
comprises a content cutter for extracting portions of the 
content information of the second intermediate language that 
are not compatible with a display capability of the device so 
that the content information of the second formatting lan- 
guage can be generated from the remaining content infor- 
mation of the second intermediate language; a layout pro- 
cessor for dynamically generating a layout format for the 
content information of the second formatting language that 
is optimized for the display screen size of the device; and a 
protocol processor for generating the content information of 
the second formatting language. 

41. The system of claim 40, wherein the layout format 
comprises a presentation shoe that includes at least one 
presentation card, each presentation card representing a 
display page of the device, each presentation card including 
at least a portion of the content information of the second 
formatting language, wherein subsequent presentation cards 
in the presentation shoe are linked so that the content 
information of the second formatting language can be selec- 
tively viewed on the device. 

42. The system of claim 41, wherein the presentation 
cards include at least one pane portion thereon and wherein 
the content information is separated into atomic groups of 
content information, the atomic groups of content informa- 
tion being organized and assigned to the pane portions of the 
presentation card so that formatting of the content informa- 
tion can be optimized for the display screen on the device. 

43. The system of claim 42, wherein the presentation 
cards in the presentation shoe are transmitted to the device 
so that the content information of the second formatting 
language can be displayed on the device. 

44. The system of claim 39, wherein the group of prede- 
termined rulesets define a branding criteria for the content 
information of the second formatting language so that the 
content information of the second formatting language dis- 
played on the device is substantially similar in appearance to 
the content information of the first formatting language 
received from the webpage. 

45. The system of claim 44, wherein the group of prede- 
termined rulesets comprise XSL rulesets defined by the host 
system. 

46. A method for intelligently harvesting information 
from a data source for display on one or more different 
information appliances, comprising: 
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receiving the information from the data source in a first 
predetermined format wherein the information has pre- 
determined hierarchical relationships; 

storing the received information in a relational markup 
language to convert the received information into a 5 
second predetermined format wherein the content of 
the received information is separated from the relation- 
ships between the received information; and 

outputting information from the second predetermined 
format into a final format for a particular information 10 
appliance having a particular display format. 

47, A layout engine for processing incoming information 
and for generating information that is displayed on one or 
more different information appliances, comprising: 

receiving information to be distributed to the one or more 15 
information appliances, the received information hav- 
ing relationships embedded into the content; 

mapping the receiving information into a relational hier- 
archy based on the relationships embedded into the 
content, the relational hierarchy including one or more 
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atomics containing the content of the receiving infor- 
mation linked to each other based on the relationships 
in the received information; and 
processing the relational hierarchy based on a display 
format of a predetermined information appliance in 
order to generate a series of displays appropriate for the 
predetermined information appliance. 
48. A method for processing incoming information having 
content and relationships embedded into the content, com- 
prising: 

separating the incoming information into one or more 
pieces of content having no relationship information; 

generating an atomic for each piece of content in the 
incoming information; and 

generating a relational hierarchy connecting the atomics 
to each other in a hierarchical relationship based on the 
relationships embedded into the incoming information. 

***** 



05/17/2004, EAST Version: 1.4.1 



